CDS

Accession Number TCMCG075C18215
gbkey CDS
Protein Id XP_017976083.1
Location join(38380988..38381146,38381225..38381362,38381500..38381581,38381919..38381995,38382075..38382190,38382282..38382399,38382542..38382658,38382770..38382861,38382950..38383016,38383098..38383268,38383374..38383517,38383593..38383739)
Gene LOC18600477
GeneID 18600477
Organism Theobroma cacao

Protein

Length 475aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018120594.1
Definition PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X2 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category J
Description Heparan-alpha-glucosaminide N-acetyltransferase-like
KEGG_TC -
KEGG_Module M00078        [VIEW IN KEGG]
KEGG_Reaction R07815        [VIEW IN KEGG]
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K10532        [VIEW IN KEGG]
EC 2.3.1.78        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00531        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko04142        [VIEW IN KEGG]
map00531        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map04142        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGCTTTCAGAATGGCTGAAATCAAAGCAGAACCTGCTCAACGTCATACTCTGGCCATCCCCATGGCCGACGACTCAGCTCAGAAGCCCAATAAAACTCAGCGTGTTGCCTCACTTGACATTTTCAGAGGCCTTACTGTGGCGGTACTCTTCATTTTCTCTGATGATTCTAGTAGATGATGCTGGAGGAGAGTGGCCTGTGATTGGTCATGCACCATGGCACGGCTGCAACCTTGCCGATTTCGTTATGCCCTTCTTTCTTTTCATTGTTGGCATGGCCATTCCTCTTGCTCTTAAGAGAATACCAGGTAAAGGCAAAGCTATCCAGAAGGTGGGTTTCAGAACTTTGAAGCTCCTCTTTTGGGGTCTCTTACTACAAGGAGGGTATTCTCATGCTCCTGACAAGCTAACATATGGCGTTGATATGAAAATGATAAGATTCTGTGGCATTCTACAGAGAATAGCTTTTGCATATCTGGTAGTGGCACTTGCAGAGATTTTTCTGAAAGATGCACAACCCAAAGATGTTTCAGCTGGTCATTTCTCTGTGTTCAGGTTATACTGTTGGCATTGGCTGGTGGGTGCATGCATACTTATTATGTACTTGGCTTTACTTTATGGAACATATGTTCCTGACTGGCAGTTCACTGTCCAAAATAAGGACAGTGCTGATTATGGGAAGGTTTTCACTGTAGCCTGCAATGTGAGAGGAAAACTGGATCCTCCTTGCAATGCTGTGGGATATATTGACAGAGAAGTATTAGGGATCAATCACATGTACCAACGACCAGCATGGAGAAGATCCAGGGCTTGCACTGTGAATTCCCCTTATGAGGGACCATTTAAAGATGCCGCTCCATCATGGTGCCATGCACCCTTCGAACCTGAAGGGATTCTAAGTTCAATATCTGCTGTTCTTTCTACAATCATTGGAGTCCATTTTGGGCATGTGCTTGTACATTTGAAGGGTCATTCTGAAAGACTGAGGCAGTGGATCATGATGGGAATTGCTCTCCTTATTCTTGGAATTGTTCTACATTTCACAGCGATTCCTTTGAATAAACAGCTATACACTTTCAGCTATGTTTGTGTAACATCTGGAGCAGCAGCACTTGTTTTCTCAGCCATCTATATCCTGGTTGATATTTGGGATCTGAAGCTGGTGTTTCTGCCATTGAAATGGATTGGCATGAATGCCATGCTGGTTTATGTTATGGCAGCTGAAGGGATCTTTGCAGGTTTCATCAATGGATGGTACTACCAGGATCCACATAATACACTGGTATATTGGATTCAAAAGCACATATTCATTGGGGTTTGGCATTCAAGAAGAGTAGGCATTCTGCTCTATGTTATATTTGCAGAGATCCTCTTCTGGGCTATCATTGCAGGCATTTTGCATCGATCAGGAATTTATTGGAAGCTTTGA
Protein:  
MLSEWLKSKQNLLNVILWPSPWPTTQLRSPIKLSVLPHLTFSEALLWRYSSFSLMILVDDAGGEWPVIGHAPWHGCNLADFVMPFFLFIVGMAIPLALKRIPGKGKAIQKVGFRTLKLLFWGLLLQGGYSHAPDKLTYGVDMKMIRFCGILQRIAFAYLVVALAEIFLKDAQPKDVSAGHFSVFRLYCWHWLVGACILIMYLALLYGTYVPDWQFTVQNKDSADYGKVFTVACNVRGKLDPPCNAVGYIDREVLGINHMYQRPAWRRSRACTVNSPYEGPFKDAAPSWCHAPFEPEGILSSISAVLSTIIGVHFGHVLVHLKGHSERLRQWIMMGIALLILGIVLHFTAIPLNKQLYTFSYVCVTSGAAALVFSAIYILVDIWDLKLVFLPLKWIGMNAMLVYVMAAEGIFAGFINGWYYQDPHNTLVYWIQKHIFIGVWHSRRVGILLYVIFAEILFWAIIAGILHRSGIYWKL